A confidence measure for speech recognition systems based on two maximum entropy approaches
نویسندگان
چکیده
We implement a confidence estimation stage for a speech recognition system by using two maximum entropy models. These models takes information from various sources including a set of scores which have proved to be useful in confidence estimation tasks, and efficiently combine those scores. Two different approaches of the maximum entropy model are developed. First a basic model which takes advantages of smoothing techniques used in a previous work, and second an optimized model, which is designed to hold a set of very few but essential characteristics of the model. Resulting model not only increase the performance of the system, but also reduces dramatically the number of parameters. We take advantages of maximum entropy paradigm by imposing restrictions to the model which are otherwise very difficult to model by using classical maximum likelihood estimation. Both models are evaluated with two different corpus and compared to a model previously developed. Keywords—Confidence estimation, maximum entropy, confidence measures, speech recognition.
منابع مشابه
Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملWord error rate minimization using an integrated confidence measure
This paper describes a new criterion of speech recognition using an integrated confidence measure for minimization of the word error rate (WER). Conventional criteria for WER minimization obtain an expected WER of a sentence hypothesis merely by comparing it with other hypotheses in an n-best list. The proposed criterion estimates the expected WER by using an integrated confidence measure with ...
متن کاملEnhancing the Robustness of the Posterior-Based Confidence Measures Using Entropy Information for Speech Recognition
In this paper, the robustness of the posterior-based confidence measures is improved by utilizing entropy information, which is calculated for speech-unit-level posteriors using only the best recognition result, without requiring a larger computational load than conventional methods. Using different normalization methods, two posterior-based entropy confidence measures are proposed. Practical d...
متن کاملUsing N-best lists for Named Entity Recognition from Chinese Speech
We present the first known result for named entity recognition (NER) in realistic largevocabulary spoken Chinese. We establish this result by applying a maximum entropy model, currently the single best known approach for textual Chinese NER, to the recognition output of the BBN LVCSR system on Chinese Broadcast News utterances. Our results support the claim that transferring NER approaches from...
متن کاملCRF-based combination of contextual features to improve a posteriori word-level confidence measures
The paper addresses the issue of confidence measure reliability provided by automatic speech recognition systems for use in various spoken language processing applications. In this context, a conditional random field (CRF)-based combination of contextual features is proposed to improve wordlevel confidence measures. More precisely, the method consists in combining phonetic, lexical, linguistic ...
متن کامل